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1.  DESCRIPTION  AND  OBJECTIVES  OF  RESEARCH  AND  SIGNIFICANT  RESULTS 
DURING  THE  GRANT  PERIOD 

Our  project  stated  the  following  broad  goals: 

a  to  design  a  practical  tool  capable  of  automating  major  aspects  of  programming  -  essentially, 
a  generalization  of  YACC  (26)  and  MACSYMA  [28]  to  facilitate  implementation  of  a  wide 
class  of  complex  nonnumerical  algorithms  (in  addition  to  parsing); 

b  to  design  and  implement  complexity  based  specification  languages; 
c  to  integrate  problem  specification,  program  design,  verification,  and  analysis  within  a  single 
unified  framework; 

d  to  design  and  efficiently  implement  pattern  directed  rule  systems  for  semantic  analysis; 
e  to  make  it  easier  to  teach  and  understand  algorithms  and  software  engineering. 

All  of  the  goals  stated  in  the  original  proposal  were  fulfilled.  The  specific  achievements 
during  the  grant  period  may  be  divided  into  four  categories  described  below. 


i.l.  Transformational  Methodology 


Underlying  our  transformational  methodology  is  the  hypothesis  that  much  of  the  difficulty 
involved  in  implementing  complex  nonnumerical  algorithms  is  due  to  human  factors  that  are 
unrelated  to  any  inherent  problem  complexity  and  can  be  solved  by  specific  transformations 
implemented  in  APTS.  These  factors  stem  largely  from  the  extensive  level  of  precise  detail 
needed  (1)  to  craft  efficient  program  loops  (2)  to  write  the  essential  imperative  bookkeeping 
operations  that  serve  to  maintain  program  invariants,  and  (3)  to  implement  tedious  but  necessary 
physical  data  structures  that  implement  conceptually  simpler  set  and  map  operations.  Although 
this  crafting  is  error  prone  if  done  by  hand,  our  transformations  allow  us  to  avoid  having  to 
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either  pointer  or  cursor  access.  These  methods  show  the  surprising  fact  that  most  of  the  algo¬ 
rithms  that  seem  to  rely  on  arrays  and  array  access  in  the  main  algorithm  texts  can  be  realized 
with  the  same  theoretical  performance  using  just  pointers  and  pointer  access.  These  techniques 
also  have  substantial  applications  to  software  design  and  high  level  language  implementation  that 
could,  for  example,  facilitate  an  efficient  implementation  of  Willard’s  RCS  database  language 
[45,46,47]. 

Although  the  linear  time  language  mentioned  earlier  is  a  broad  based  automatic  method  for 
producing  software,  it  has  sharp  ramifications  to  algorithm  design  as  well.  For  example,  we 
showed  that  the  very  basic  problem  of  Propositional  Horn  Clause  Satisfiability  can  be  expressed 
in  our  linear  time  language,  thereby  ensuring  a  new  linear  time  pointer  machine  solution. 

Our  two  articles  with  Tarjan  on  applications  and  improvements  to  partition  refinement 
strategies  [34,35]  and  our  more  recent  article  with  Bloom  [4]  exhibit  the  power  of  the  dominated 
convergence  argument  for  computing  fixed  points.  One  of  these  applications  remains  the  best 
solution  to  the  basic  problem  of  lexicographic  sorting.  Two  of  the  other  applications  to  ready 
simulation  and  strong  bisimulation  have  had  an  impact  on  the  field  of  distributed  computing. 

Our  recent  work  with  Chang  on  processing  regular  expressions  [14]  exhibits  our  differential 
transformations  applied  in  a  new  functional  language  context.  In  CPM  ’92  we  gave  theoretically 
and  computationally  superior  solutions  for  the  classical  problems  of  (1)  transforming  regular 
expressions  into  nondeterministic  finite  state  automata,  (2)  turning  nondeterministic  finite  state 
automata  into  deterministic  finite  state  automata,  (3)  acceptance  testing  in  nondeterministic  finite 
state  automata,  and  (4)  regular  expression  search.  These  problems  are  important,  because  of  the 
many  practical  applications,  including  compilation  of  communicating  processes,  string  pattern 
matching,  model  checking,  lexical  scanning,  and  VLSI  layout  design. 

1.3.  Transformational  System 

In  order  to  mechanize  the  transformational  methodology,  we  decided  to  design  a  transfor¬ 
mational  programming  system  with  bottom-up  tree  pattern  matching  as  a  fundamental  operation. 
Since  Hoffmann  and  O’Donnell’s  seminal  work  in  1982  [25],  bottom-up  tree  matching  has  been 
regarded  as  important  but  very  difficult,  with  no  obvious  way  to  make  either  theoretically  or 
computationally  tractable. 

In  CAAP  ’90  Cai,  myself,  and  Tarjan  presented  the  first  theoretical  improvement  to  Hoff¬ 
mann  and  O’Donnell,  and  our  paper  was  .selected  for  publication  in  a  special  issue  of  Theoretical 
Computer  Science  on  best  papers  from  the  conference  [11].  This  paper  included  an  algorithm  for 
pattern  matching,  where  a  newly  propo,sed  on-line  algorithm  outperforms  a  batch  algorithm  that 
was  its  direct  predecessor. 

In  1991  we  completed  the  APTS  transformational  system  at  the  University  of  Wisconsin. 
We  used  APTS  to  build  two  compilers  for  conducting  experiments  to  test  the  feasibility  of  these 
techniques.  One  of  these  compilers  translates  a  major  fragment  of  SETL2  into  C.  The  other 
translates  SQ2+  (a  strongly  typed  functional  subset  of  SETL2  augmented  with  fixed  point 
expressions)  into  C. 

The  APTS  system  comprises  about  15,()(K)  lines  of  SETL2,  a  nice  manageable  size  encour¬ 
aging  further  development.  SETL2  is  written  in  C,  which  makes  APTS  portable  to  more 


that  this  is  greatly  improvable. 

These  experimental  results  are  highly  encouraging.  The  latest  experiments  compared 
benchmarks  for  implementations  of  10  different  algorithms.  For  nine  of  these  algorithms  bench¬ 
marks  were  made  comparing  executable  low  level  SETL2  specifications  with  C  codes  automati¬ 
cally  generated  from  them  (see  Fig.  1).  The  sizes  of  the  SETL2  specifications  ranged  from  25 
lines  for  a  maximal  independent  set  algorithm  to  103  lines  for  Floyd’s  implementation  [21]  of 
Dijkstra’s  single  source  shortest  path  algorithm  [18]  with  2-heaps  (see  Fig.  2).  The  Generated  C 
programs  ranged  fi-om  133  lines  for  graph  reachability  to  766  lines  for  Floyd’s  algorithm.  All 
operations  were  explicitly  written  into  the  source  code  (e.g.  heap  operations  were  written  explic¬ 
itly  into  the  SETL2  specification  of  Floyd);  there  were  no  hidden  library  routines.  Source  lines 
were  measured  from  prettyprinted  text  (free  from  comments)  using  the  UNIX  wc  function. 
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Based  on  assumptions  described  in  [13],  we  measure  the  increa.se  in  productivity  by  the 
number  of  source  lines  of  the  compiled  C  program  divided  by  the  number  of  source  lines  of 


faster  than  the  SETL2  program  generated  from  it,  and  has  75  times  more  source  lines  than  the 
specification.  It  is  interesting  to  note  firom  Fig.  1  that  the  attribute  closure  C  program  generated 
by  our  SETL-to-C  translator  from  a  much  more  detailed  low  level  SETL  specification  runs  no 
faster  than  the  C  program  generated  from  the  SQ2+  specification.  It  is  also  interesting  to  note 
that  if  Kirk  Snyder’s  and  our  estimates  that  his  SETL2  is  about  30  times  slower  than  hand  coded 
C  is  accurate,  then  the  performance  of  our  generated  C  program  would  be  comparable  to  good 
hand  code. 

Finally,  an  experiment  was  performed  to  test  the  potential  for  scaled  up  applications. 
Raytheon,  an  industrial  affiliate  of  the  DOD-sponsored  prototech  projects,  provided  the  investi¬ 
gators  with  a  moderate  sized  prototype  software  package  to  generate  tracks  from  input  radar  plot 
data  using  an  alpha-beta  filter  with  constant  scalar  coefficients.  "The  program  processes  a  file  of 
radar  plot  data  and  produces  a  Postscript  output  file  which  shows  the  original  plots  and  program 
generated  radar  tracks  superimposed  on  a  plan  of  the  radar  site."  The  Raytheon  code,  which 
comprises  9  packages  and  classes  and  over  1000  lines  of  source,  makes  use  of  every  aspect  of 
SETL2.  The  experiment  involved  translating  the  whole  main  SETL2  module  into  C,  which 
would  interoperate  with  the  remaining  SETL2  modules.  The  productivity  improvement  (i.e.,  477 
lines  of  generated  C  plus  SETL2  interoperability  code  divided  by  64  lines  in  the  SETL2  main 
module)  was  7.7.  The  overall  speedup  was  2.6,  which  was  surprisingly  good,  considering  the 
tremendous  overhead  in  the  way  that  SETL2  currently  implements  interoperability  with  C. 
Overhead  is  due  to  added  levels  of  procedure  calls,  and  massive  datatype  conversion,  because 
only  strings  are  currently  allowed  to  pass  between  C  and  SETL2.  Recall  also,  that  only  one  out 
of  the  9  modules  was  translated  into  C. 

1.5.  Related  Work 

The  SETL  research  program  directed  by  Jack  Schwartz  from  about  1971  until  1989  has  had 
the  most  positive  influence  on  our  work.  Schwartz  effectively  demonstrated  the  conceptual  sim¬ 
plicity  of  the  SETL  language  with  its  rich  repertoire  of  universal  set  theoretic  dictions  [38],  and 
introduced  many  important  concepts  in  an  attempt  to  solve  the  formidable  task  of  implementa¬ 
tion  and  optimization.  SETL’s  mathematical  value  semantics  made  costly  hidden  copy  opera¬ 
tions  a  problem.  Dynamically  allocated  aggregate  datatypes  with  arbitrary  levels  of  nesting 
made  memory  management  difficult.  Dynamic  weak  typing,  massive  overloading,  and  the 
prevalence  of  operations  that  relied  on  associative  access  compounded  the  run-time  costs  and  the 
complexity  of  an  efficient  implementation. 

The  SETL  optimizer  [37,23]  dealt  with  the  hidden  copy  problem  by  making  a  conservative 
estimate  of  when  a  set  could  be  updated  de.structively  or  not,  and  by  using  a  run-time  bit  (that 
could  be  turned  on  but  not  off)  to  indicate  this  fact.  Snyder’s  SETL2  implementation  uses  refer¬ 
ence  counts  for  the  same  purpose.  The  SETL  system  relied  on  compacting  garbage  collection  for 
dynamic  allocation,  whereas  SETL2  uses  collections  of  stacks.  SETL  used  an  elaborate  hashing 
mechanism  as  a  default  implemention  of  associative  access  for  arbitrary  datatypes.  Its  optimized 
data  structures  combined  hashing  with  bit  vectors  and  plex  structures  to  reduce  the  amount  of 
replicated  data  and  the  expense  of  ha.shing  [39,17,37).  SETL2  uses  tries,  and  attempts  no  com¬ 
pile-time  optimization.  SETL  attempted  to  generate  more  efficient  code  using  type  analysis  with 
an  accuracy  up  to  three  levels  of  nesting  (44|,  for  sets,  maps,  and  tuples  (in  order  to  guarantee 


Finally,  there  were  conceptual  as  well  as  engineering  problems  with  the  SETL  data  struc¬ 
ture  optimization.  The  fundamental  data  structure  that  formed  the  spine  of  a  data  structure 
aggregate  was  the  hash  table.  Hashing  was  used  heavily  in  inputting  data,  in  forming  the  initial 
optimized  data  structures,  and  was  used  to  implement  any  set  that  was  iterated  over.  The  heuris¬ 
tic  nature  of  the  SETL  optimizations  and  the  unpredictability  of  expensive  hidden  operations  was 
observed  by  both  Shields  and  Straub. 

Our  real-time  simulation  provides  new  theoretically  sound  conceptual  underpinnings  to  data 
structure  selection  with  precise  guarantees  of  speedup.  Our  transformation  is  grounded  in  basic 
algorithm  theory  by  forming  data  structure  aggregates  from  arrays  and  linked  lists  instead  of 
hash  tables.  Iteration  over  a  SETL2  set  does  not  force  us  to  resort  to  hashing.  Our  routine  to 
input  data  and  to  form  initial  data  structures  is  highly  efficient  without  making  use  of  hashing. 
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